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ABSTRACT 


'Two  additive  stage  processing  models  are  presented  that  address  some  of  the 
tasks  reported  in  Antos,  Kozminsky,  Bourne,  and  Kintsch  (1981).  The  development 
and  application  of  these  models  to  information  analysis  and  decision  procedures 
demonstrates  their  usefulness  in  structuring  thinking  and  experimental  design  anout 
critical,  complex,  real-life  sorts  of  decisions.  Both  alternative  processing 
models  presented  were  constructed  by  carefully  thinking  out  what  mental  operations 
are  necessary  to  perform  information  analysis,  decision  and  verification.  Pre¬ 
dictions  are  generated  from  the  models  and  those  predictions  were  compared  to  the 
data  reported  in  Antos,  et  al.  While  some  predictions  of  the  model  are  supported, 
new  experiments  are  needed  to  test  more  critical  predictions.  The  intention  of 
this  work  is  to  lay  some  of  the  theoretical  groundwork  for  designing  experiments 
that  would  provide  information  on  the  critical  elements  in  how  people  understand 
facts  in  order  to  make  decisions.  _ 


This  report  presents  additive  stage  process  models  which  address  several 
of  the  tasks  used  in  experiments  reported  by  Antos,  Kozrainsky,  Bourne,  and 
Kintsch  (1981).  These  models  make  use  of  the  principles  embodied  in  several 
familiar  models  of  sentence  verification  (e.g.,  Carpenter  &  Just,  1975).  The 
theoretical  character  of  such  models  has,  furthermore,  been  carefully  examined 
by  S.  Sternberg  (1969).  The  role  of  the  models  is  to  specify  the  mental 
operations  necessary  for  the  performance  of  the  tasks  in  question,  mainly 
decision  or  verification  tasks.  Models  of  this  sort  generate  predictions 
concerning  the  relative  time  that  a  subject  should  need  to  respond  in 
different  experimental  conditions.  To  the  extent  that  these  predictions  are 
confirmed  by  the  corresponding  data,  the  models  are  viewed  as  credible.  The 
immediate  goal  is  to  present  a  few  models  of  this  sort  in  order  to  illustrate 
their  essential  principles  and  assumptions.  It  will  then  be  possible  to 
compare  the  predictions  generated  by  the  models  with  data  already  collected. 

Table  1  provides  an  overview  of  all  four  sessions  in  the  experiment. 
Table  2  shows  the  basic  design  of  texts  in  the  decision  task.  Table  3  shows 
an  example  of  an  assembled  text.  Finally,  Table  4  shows  the  basic  design  of 
texts  used  in  the  value  verification  procedure.  For  further  methodological 
details,  the  reader  is  referred  to  Antos,  et  al.  (1981). 


Insert  Tables  1-4  About  Here 
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Decision  Task 

As  discussed  on  page  20  of  Antes,  et  al.,  the  reaction  time  measures  RT1 
and  RT2  mean  different  things  for  RA  and  RB  conditions.  Because  RT2 
represents,  in  both  cases,  the  phase  at  which  a  response  is  finally  made,  it 
is  of  greater  interest  in  the  current  discussion.  For  the  RB  group,  RT2 
refers  to  the  time  needed  to  read  the  text  and  make  a  decision,  while,  for  the 
RA  group,  it  refers  to  the  time  needed  to  read  the  rule  and  make  a  decision. 
Reading  the  whole  text  involves  many  more  mental  operations  than  reading  the 
rule,  resulting  in  much  higher  variance  (as  well  as  magnitude)  for  the  RB 
group's  RT2  scores.  For  this  reason,  it  is  convenient  to  concentrate  on  the 
RA  group  at  this  time. 

While  terminating  versus  exhaustive  reading  is  a  strategic  option  in  the 
rule-before  condition,  the  rule-after  subject  has  little  choice  but  to  read 
each  sentence  of  a  passage  and  strive  to  store  the  categories  discussed,  along 
with  their  values.  It  turns  out,  nevertheless,  that  the  reader  may  proceed  in 
more  than  one  way.  First,  it  is  possible  that,  for  each  sentence,  the  subject 
stores  only  the  category  discussed  explicitly,  along  with  the  value  provided 
(e.g.,  capitalization — high).  Second,  it  is  possible  that  the  reader  computes 
the  value  associated  with  any  categories  correlated  with  the  three  explicit 
categories.  While  it  might  be  argued  that,  ignorant  of  the  rule,  it  would  be 
useful  for  the  RA  subject  to  compute  all  correlated  values,  it  is  certainly 
possible  to  delay  this  until  the  rule  is  presented.  .Among  other  things,  this 
would  reduce  the  subject's  memory  load  from  five  to  three  categories. 
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Let  us  examine  the  RT2  decision  scores  for  the  RA  group.  For  each  of  the 
BUY,  NOT  BUY,  and  INSUFFICIENT  INFORMATION  problem  types,  response  latencies 
differed  considerably  as  a  function  of  the  number  of  explicit  categories  (see 
Table  1).  If  the  RA  group  computed  the  value  of  correlated  categories  while 
reading  the  problem,  then  RT2  should  not  vary  as  a  function  of  the  number  of 
implicit  categories.  For  this  reason,  it  will  be  assumed,  for  now,  that  RA 
subjects  do  not  compute  the  value  of  correlated  categories  in  the  course  of 
reading  the  problem. 

Figure  1  shows  a  process  model  that  addresses  the  RA  group  decision  task, 
with  the  assumption  that  the  values  of  correlated  categories  need  to  be 
computed  at  decision  time. 


Insert  Figure  1  About  Here 


Stages  1  to  5  refer  to  the  reading  of  the  problem  and  thus  do  not  address 
RT2.  Let  us,  however,  briefly  consider  these  stages.  Stage  1  is  not  a  mental 
operation,  but  rather  refers  to  the  initialization  of  counters  for  "sentence 
within  the  passage"  (_s)  and  "relevant  categories"  (i).  The  model  of  Carpenter 
and  Just  (1975)  includes  such  counters.  Since  each  passage  consisted  of  three 
sentences,  s_  has  the  possible  values  0,  1,  2,  and  3. 

At  stage  2,  the  sentence  counter  is  incremented.  On  the  first  cycle, 
then,  s=l.  The  subject  then  encodes  the  sentence  to  its  propositional  form  at 
stage  3.  Since  the  subject  does  not  know  the  rule,  values  are  stored,  at 
stage  4,  for  each  category  mentioned  in  the  passage.  Finally,  we  arrive  at 
stage  5,  at  which  it  is  determined  whether  there  are  any  more  sentences  to  be 
read . 


..  .  _rj 
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The  portion  of  Model  A  that  addresses  the  decision  (and  RT2)  begins  at 
stage  6,  which  involves  the  reading  of  the  rule.  The  rule  mentions  two 
relevant  categories,  which  can  be  referred  to  as  R  and  R  .  At  stage  7,  the 
relevant-category  counter,  i,  is  incremented.  On  the  first  cycle,  then,  i 
equals  1. 

An  important  assumption  will  be  adopted  with  regard  to  the  remaining 
processing  stages.  It  states  that  the  subject  will  proceed  toward  a  solution 
in  the  most  logical  and  efficient  manner.  Let  us  briefly  consider  two 
implications  of  this  assumption.  (1)  Because  of  the  nature  of  the  conjunctive 
concept,  the  logical  subject  can  make  a  NOT  BUY  decision  upon  the  discovery  of 
just  a  single  relevant  category  that  is  negative.  For  this  reason,  the  model 
postulates  that  processing  may  terminate  as  soon  as  a  single  negative  relevant 
category  is  identified.  (2)  Similarly,  processing  may  terminate  in  an 
INSUFFICIENT  INFORMATION  decision  as  soon  as  it  is  discovered  that  no 
information  has  been  stored  for  a  single  relevant  category. 

Stage  8  is  a  test  which  asks  whether  a  value  has  been  stored  for  relevant 
category  R  .  (This  operation  resembles  one  shown  to  play  an  important  role  in 
sentence  verification  by  Singer  (1981)).  If  no  such  information  has  been 
stored,  it  is  asked  at  stage  9  whether  R  is  correlated  with  another  category. 
If  not,  it  is  established  that  no  information  is  available  for  R  .  Control 
then  flows  to  stage  12  at  which  a  response  index,  initialized  at  BUY  (Clark  & 
Chase,  1972)  is  changed  to  "insufficient  information."  That  response  is 


registered  at  stage  13. 
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If  it  is  established  at  stage  9  that  R  does  have  a  correlated  category, 
then  the  value  of  that  category  is  retrieved,  if  possible,  at  stage  9'.  If 
there  is  no  value,  control  again  flows  to  stages  12  and  13.  If  chere  is  such 
a  value,  however,  the  value  of  R  can  be  computed. 

Regardless  of  whether  the  preceding  operation  was  8  or  9',  it  is  asked  at 
stage  10  whether  the  value  of  R  is  negative.  If  so,  the  response  index  may 
be  immediately  changed  at  stage  12  and  a  NOT  BUY  response  registered.  If  the 
value  is  positive,  test  11  asks  whether  there  are  any  more  relevant  categories 
to  examine.  If  not,  then  it  is  established  that  both  relevant  categories  are 
positive:  a  BUY  response  is  registered  at  stage  13.  If  there  is  another 
relevant  category  to  examine,  i  is  incremented  (to  2)  at  stage  7.  Processing 
continues  in  the  way  just  described. 

Predictions 

It  is  easiest  to  illustrate  the  predictions  of  Model  A  within,  rather 
than  between,  response  types.  Consider  the  BUY  response,  for  example.  As 
discussed  elsewhere  in  this  report,  BUY  problems  differ  only  with  respect  to 
the  number  of  implicit  categories  they  include  (i.e.,  the  number  of  categories 
whose  value  must  be  inferred  from  a  correlated  category).  Model  A  specifies 
identical  processing  for  all  BUY  problems  except  in  that  each  additional 
implicit  category  requires  the  execution  of  stages  9  and  9'.  The  model 
clearly  predicts  that,  for  BUY  problems,  0-implicit  RT2  values  will  be  fastest 
and  2-implicit  values  will  be  slowest. 

The  actual  mean  correct  decision  times  for  BUY  problems  were  3.3,  5.9, 
and  4.8  seconds  for  the  0-,  1-,  and  2-implicit  conditions,  respectively.  The 
prediction  that  0-implicit  latencies  would  be  fastest  is  confirmed,  while  the 
1-  versus  2-implicit  prediction  is  reversed  in  the  data.  It  should  be 
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considered,  however,  that  the  low  correct  proportion  of  .56  casts  some  doubt 
on  the  usefulness  of  the  mean  value  for  the  2-implicit  condition.  The  reason 
for  this  is  that,  with  high  error  rates,  it  becomes  increasingly  likely  that 
the  proportion  correct  (i.e.,  .56)  includes  numerous  trials  on  which  the 
subject  did  not  perform  the  problem  as  specified  by  the  model  but,  rather, 
guessed  the  correct  answer  (see  Table  5). 


Insert  Tables  5-6  about  here 


Model  A  also  generates  a  clear-cut  prediction  for  the  INSUFFICIENT 
INFORMATION  problems.  It  states  that  decision  time  will  be  faster  for  the 
2-missing  problems  than  for  any  other  type.  The  reason  for  this  is  tnat  the 
subjects  who  have  grasped  the  problem  will  be  able  to  register  the 
INSUFFICIENT  INFORMATION  response  as  soon  as  a  single  category  is  discovered 
to  have  no  value.  The  processing  sequence  for  such  a  problem  would  be 
6-7-8-9-12-13  (see  Figure  1).  For  all  other  INSUFFICIENT  INFORMATION  problem 
types  (e.g.,  1  explicit,  1  missing),  there  is  a  probability  of  .5  that  the 
missing  category  would  not  be  examined  first,  resulting  in  longer  decision 
times.  The  actual  response  latencies  were  3.8  sec  for  the  2-missing  problems 
and  5.1  sec  for  all  other  problems. 

Verification  Task 

Recall  that  in  Session  4,  subjects  performed  a  verification  task.  After 
reading  an  antecedent  passage,  the  subject  had  to  judge  the  truth  of  a 
statement  that  a  particular  category  had  a  specific  value  (e.g., 
capitalization  was  h igh) .  This  task  is  addressed  by  Model  B,  shown  in  Figure 


2.  At  stage  1,  the  test  sentence  is  encoded  in  its  propos itional  form. 


Test 
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2  asks  whether  any  information  was  stored  for  the  category  discussed  in  the 
test  sentence.  if  no  information  was  stored,  test  2'  asks  whether  the  test 
category  is  correlated  with  any  other  category,  and  if  so,  whether  a  value  has 
been  stored  for  the  correlated  category  (test  2").  If  either  2'  or  2"  fail, 
control  flows  to  the  response  index  change  operation  4,  and  an  INSUFFICIENT 
INFORMATION  response  may  be  registered. 


Insert  Figure  2  About  Here 


If  a  value  is  available  (inverse  correlations  require  a  value  inversion 
at  stage  2""),  then  it  may  be  compared  with  the  test  value  at  stage  3.  A 
mismatch  results  in  an  index  change  to  NOT  BUY  at  stage  4,  while  a  match  leads 
to  a  BUY  response  at  5. 

Model  B  generates  a  variety  of  predictions  concerning  the  verification 
task  response  times.  For  categories  mentioned  explicitly,  it  states  that  NOT 
BUY  responses  will  be  longer  than  for  BUY.  This  is  because,  relative  to 
BUY's,  NOT  BUY  items  require  the  additional  response  index  change  at  stage  4. 
The  Model  B  also  predicts  that  the  verification  of  the  value  of  an  implicit 
(i.e.,  correlated)  category  will  exceed  explicit  ones,  as  long  as  the  subject 
has  not  computed  the  correlated  value  while  reading  the  problem. 

The  only  one  of  these  predictions  that  can  be  examined,  however,  is  the 
one  that  states  that  NOT  BUY  response  times  will  be  longer  than  BUY's. 
Collapsing  across  RA  and  RB,  BUY  responses  (response  type  1)  took  2214  msec  as 
compared  with  2642  msec  for  NOT  BUY  (see  Table  6).  Since  no  verification 
items  concerned  implicit  categories,  the  other  predictions  cannot  be 
inspected . 
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Possible  Experiments 

While  models  A  and  B  receive  at  least  some  support  from  the  decision  and 
verification  data,  it  will  be  difficult  to  evaluate  them  fully  without  further 
experimentation.  It  would  be  relatively  straightforward  to  design  experiments 
comparable  to  the  ones  discussed  here  that  would  more  incisively  inspect  the 
predictions  of  models  such  as  A  and  B.  The  following  are  some  examples. 

Decis ion  Task 

In  order  to  test  his  model  of  sentence  verification,  Singer  (1981)  used 
natural  language  materials  that  were  as  simple  as  possible.  The  rationale  was 
that  it  is  important,  at  least  at  a  preliminary  stage  of  an  investigation,  to 
eliminate  as  much  variance  as  possible  from  the  response  latencies.  For  the 
present  purposes,  certain  simplifications  are  similarly  possible. 

Consider  the  following  procedure  for  an  RA  group  only.  The  subject  reads 
the  passage  one  sentence  at  a  time,  instead  of  as  a  whole.  Reading  is 
self-paced,  with  the  reader  pressing  a  space  bar  to  view  the  next  sentence. 
Sentence  reading  times  are  recorded.  As  before,  the  passage  mentions  three 
categories,  including  at  least  one  relevant  and  one  irrelevant  one.  The 
sentences  are  simplified  as  compared  with  the  previous  materials:  instead  of 
presenting  a  complicated  statement  from  which  the  reader  deduces  the  value  of 
one  of  the  categories,  the  sentences  simply  assert  such  values  (e.g.,  earnings 
are  low) .  Immediately  after  the  last  sentence,  the  reader  sees  the 
conjunctive  rule,  makes  a  decision,  and  responds. 

Here  are  a  few  of  the  predictions  of  model  A  that  this  experiment  would 


address  . 
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(1)  For  problems  with  two  explicit  categories,  NOT  BUY  decisions  should 
be  much  faster  than  BUY.  This  is  especially  true  if  both  relevant  categories 


were  negative 

for 

the 

NOT  BUY 

problems , 

as  was  the  case  in 

previous 

experiments . 

NOT 

BUY 

dec  is  ions 

could  be 

signalled  immediately 

after  the 

discovery  of  the  first  negative  category. 

(2)  INSUFFICIENT  INFORMATION  decisions  are  predicted  to  be  very  fast, 
with  an  operation  sequence  from  the  reading  of  the  rule  of  6-7-8-9-12-13  (see 
Figure  1).  This  sequence,  of  course,  would  be  longer  if  the  missing  category 
was  the  second  rather  than  the  first  one  evaluated  by  the  reader.  The 
experiment  has  the  merit  of  examining  the  differential  predictions  that  the 
model  specifies,  for  example,  for  missing-explicit  versus  explicit-missing 
problems . 

(3)  Perhaps  the  most  interesting  issue  concerns  the  predictions  of  model 
A  for  the  BUY  and  NOT  BUY  problems  involving  implicit  categories.  The 
predictions  depend  on  whether  or  not  it  is  believed  that  the  RA  subjects 
compute  the  value  of  correlated  categories  when  they  read  the  passage.  If 
not,  their  decision  would  require  the  execution  of  operations  9  and  9'  every 
time  an  implicit  category  was  identified.  This,  in  turn,  would  lead  to  the 
prediction  of  longer  decision  times  (i.e.,  response  time  from  operation  6 
onward)  as  a  function  of  the  number  of  implicit  categories.  If,  on  the  other 
hand,  it  is  believed  that  correlated  values  are  computed  during  initial 
reading,  the  model  would  predict  decision  times  that  are  independent  of  the 
number  of  implicit  categories. 
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The  proposed  experiment,  in  fact,  provides  an  independent  source  of 
evidence  concerning  the  time  of  the  computation  of  correlated  values:  namely, 
the  initial  sentence-reading  times.  Computation  of  correlated  values  during 
initial  reading  should  produce  considerably  higher  reading  times  for  sentences 
dealing  with  correlated  categories. 

Verification  Task 

A  comparable  experiment  could  be  conducted  to  examine  Model  B  directly. 
Subjects  would  read  passages  including  either  two  or  three  sentences  that 
state  the  values  of  correlated  or  uncorrelated  categories  (e.g. ,  dividends  are 
high).  Immediately  after,  the  reader  would  view  a  test  item  to  verify.  The 
subject  would  register  responses  such  as  TRUE,  FALSE,  DON'T  KNOW,  using 
appropriately  labelled  switches. 

As  was  the  case  for  the  decision  experiment,  it  would  be  useful,  at  this 
stage,  to  keep  the  materials  as  simple  as  possible.  It  would  be  possible,  for 
example,  to  include  only  positive  correlations  to  link  categories.  This  would 
permit  the  elimination  of  stages  2'"  and  2""  (Figure  2).  Even  with  this 
simplification,  there  would  be  numerous  types  of  test  items,  including 
Explicit  Buy  and  Not  Buy,  Implicit  Buy  and  Not  Buy,  and  Insufficient 
Information  with  and  without  a  correlated  category.  The  model  specifies  the 
operation  sequence  for  each  problem,  from  which  sequences  important  ordinal 
predictions  are  derived. 
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General  Discussion 

Some  general  points  should  be  made  at  this  stage: 

(1)  The  evaluation  of  Models  A  and  B  with  reference  to  the  data  already 
collected  needs  to  be  considered  with  caution,  both  in  terms  of  successes  and 
failures.  As  has  already  been  pointed  out,  none  of  the  earlier  experiments 
was  devised  to  test  these  models.  The  earlier  experiments  included 
complicated  stimulus  materials  that  insured  the  high  variability  of  response 
times.  Numerous  interesting  predictions  of  the  models  could  not  be  examined 
at  all.  The  usefulness  of  such  models  toward  understanding  the  tasks  under 
consideration  can  only  be  established  by  means  of  experiments  that  more 
directly  address  the  models  themselves. 

(2)  The  data  of  the  proposed  experiment  will  not  be  used  simply  to 
examine  the  ordinal  predictions  of  the  model.  Rather,  it  will  be  possible  to 
submit  those  data  to  multiple  linear  regression  analysis,  using  the  number  of 
executions  of  each  hypothetical  operation  as  predictor  variables.  Such  a 
procedure  permits  the  determination  of  the  proportion  of  variance  among 
conditions  accounted  for  by  the  model,  goodness  of  fit,  and  parameter 
estimates  (i.e.,  the  magnitude  of  the  duration  of  the  component  operations). 
This  procedure  is  well  documented  (e.g.,  Carpenter  &  Just,  1975;  Schu. “ack  & 
Sternberg,  1981). 

(3)  It  is  not  asserted  that  these  models  can  account  for  all  of  the 
features  of  subjects'  performance  in  these  tasks.  One  outcome  that  is 
apparently  beyond  the  scope  of  the  models  is  the  fact,  mentioned  by  Antos,  et 
al.  (1981,  p.  21),  that  explicit  information  has  more  impact  than  implicit 


information  on  the  subject's  decision,  when  one  relevant  category  is  missing. 
As  a  second  example,  the  models  do  not  address  the  various  interactions 
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involving  the  consistency  variable  (Tables  8  and  9  of  Antos  et  al.).  It 
should  be  noted,  however,  that  these  outcomes  are  somewhat  reminiscent  of 
certain  semantic-relatedness  effects.  Rips,  Shoben,  and  Smith  (1973),  for 
example,  reported  that  it  takes  less  time  to  verify  a  robin  is  a_  bird  than  ja 
penguin  is  _a  bird .  Models  such  as  that  of  Carpenter  and  Just  have  typically 
not  accounted  for  such  findings  (although  they  could  conceivably  be  elaborated 
to  do  so) . 


Text  assembled  according  to  design  specification  of 
Trial  2  in  Table  1 


Sales  of  large-scale  data  processing  systems  are  substantial  in 
dollar  terms  and  are  expanding  modestly.  Increased  debt  to  capital 
ratio  severely  reduced  Ectex  cash  position.  Directors  will  meet 
next  month  and  there  is  speculation  about  a  stock  split  of  3  for  1. 


Design  block  example  for  Experiment  2.  The  signs  and  "+"  represent  categories  present  in  the  text  and 


+■>  4->  4->  CO  tO  (/> 


4->4->  (D<U4->4->a)aJlOOO(^ 

cc4->-*->cc4->+^cec: 
a>a>ooi/iai<U(/)i/)ooo 


ooccooccooo 
ooj,—  •  2:  z  z 


Not  Inconsistent 


letion  Verification 


parentheses  are  for  the  verification  task.  Those  not  in  parentheses  are  for  the  completion  task. 


Mean  Reaction  Time  in  Seconds  and  Proportion  Correct  for  Verification  Task 


Response  Types  Are:  Buy  and  Not  Buy 


FIGURE  2 
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